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Given an i.i.d. sample drawn from a density /, we propose to test 
that / equals some prescribed density fo or that / belongs to some 
translation/scale family. We introduce a multiple testing procedure 
based on an estimation of the L2-distance between / and fo or be- 
tween / and the parametric family that we consider. For each sample 
size n, our test has level of significance a. In the case of simple hy- 
potheses, we prove that our test is adaptive: it achieves the optimal 
rates of testing established by Ingster [J. Math. Set. 99 (2000) 1110- 
1119] over various classes of smooth functions simultaneously. As for 
composite hypotheses, we obtain similar results up to a logarithmic 
factor. We carry out a simulation study to compare our procedures 
with the Kolmogorov-Smirnov tests, or with goodness-of-fit tests pro- 
posed by Bickel and Ritov [in Nonparametric Statistics and Related 
Topics (1992) 51-57] and by Kallenberg and Ledwina [Ann. Statist. 
23 (1995) 1594-1608]. 

1. Introduction. Suppose that we observe n independent and identically 
distributed (i.i.d.) real random variables X±, . . . ,X n with common unknown 
density /. Let fo be some specified density. In this paper we consider the 
problem of testing the null hypothesis "/ £ against "/ ^ where T 
equals either the singleton {fo} or the parametric family 

(1.1) F={±fo(^),MeK}, 

for some subset K of R X ]0, +oo[. 

This problem has been widely studied since the famous Kolmogorov- 
Smirnov and Cramer-von Mises tests based on the empirical distribution 
function. 
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Assuming that / belongs to L2QR), it is quite natural to construct a test 
based on the estimation of the squared L2-distance between / and T . In 
order to test the simple hypothesis "/ = /o," we actually consider a suitable 
collection of estimators of — /o) 2 and decide to reject the null hypothesis 
if some estimator in the collection is larger than its (1 — u a ) quantile under 
the null hypothesis, u a being calibrated so that the final test has level of 
significance a. We then generalize this procedure to test that / belongs to 
the translation/scale family given by (1.1). 

From a theoretical point of view, we evaluate the performances of our 
tests in terms of uniform separation rates with respect to the L2-distance 
over classes of smooth functions. Given j3 in ]0, 1[ and a class of functions 
B C L2OR), we define the uniform separation rate p(& a ,B,f3) of a level a 
test <I> Q of the null hypothesis "/ £ •T 7 " over the class B as the smallest 
number p such that the test guarantees a power at least equal to (1 — (3) for 
all alternatives / in B at a distance p from T . More precisely, denoting by 
d,2 (/, J-) the L2-distance between / and T and by P^ the distribution of the 
observation [X\ , . . . , X n ) , 

= inf{p>0, V/Gfi, <h(f,F)>p^>Ff(* a rejects) > 1-/3}. 

Assuming that / belongs to £>, the uniform separation rate p($ a ,B,(3) is 
asymptotically related to the minimax rate of testing p n introduced by Ing- 
ster [14] and referred to as the critical radius. Indeed, by definition, p n — ► 
as n — > +00 and satisfies: 

(a) For any sequence p' n such that p' n / p n = o n (l), 
inf(supP f ($ n = l) + sup P / ($ n = 0)j = l-o n (l), 

where the infimum is taken over all tests $ n with values in {0, 1} rejecting 
the null hypothesis "/ S J- v when $ n = 1 . 

(b) For any a, (3>Q, there exist some constant C > and some test <3?* 
such that the two following inequalities hold: 

(1.2) su P P / ($; = l)<Q + o n (l), 

(1.3) sup P/($; = 0)</3 + o n (l). 
feB,d2(f,r)>c Pn 

Since the goodness-of-fit test to some specified density /o can be reduced 
to a test of uniformity on [0,1] for the variables Fq(Xi) (where Fq is the 
distribution function associated with the density /o), many papers are de- 
voted to the problem of testing uniformity on [0,1]. The main reference for 
the computation of minimax rates of testing for this problem is the series 
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of papers due to Ingster [14]. In particular, under the prior assumption that 
/ belongs to some Holder class with smoothness parameter s > 0, Ingster 
establishes the minimax rate of testing p n = n~ 2s /^ s+1 \ But the tests pro- 
posed to ensure the achievement of this rate [namely the inequalities given 
in (1.2) and (1.3)] are structurally based on the prior assumption; this is 
a crucial problem for their practical application since the smoothness pa- 
rameter s is typically unknown. Following the work of Spokoiny [23] in the 
Gaussian white noise model, Ingster [15] focuses on the problem of finding 
an adaptive (assumption- free) test of uniformity on [0,1]. He proves that 
adaptation is not possible without some loss of efficiency of the order of an 
extra log log n factor and he presents an adaptive test which is based on 
chi-square statistics. 

Other methods having Neyman's test as starting point are proposed in 
order to avoid using any prior assumption on the smoothness of /. To test 
uniformity on [0, 1], Neyman [20] suggests considering some orthonormal ba- 
sis {(pi, I > 0} of L2QO, 1]) with 4>o = I[o,i] an d rejecting the null hypothesis 
"/ = I [0jl] " if the estimator E£i(E?=i UXi)/nf of 9 D = ££i(/[o,i] f<Pi) 2 
is large enough, where D is some given integer. Bickel and Ritov [4], Led- 
wina [19] and Kallenberg and Ledwina [17] introduce data-driven versions of 
Neyman's test where the parameter D is chosen via some penalized criterion. 
Inglot and Ledwina [13] establish theoretical results for the test described in 
Kallenberg and Ledwina [17]. These results which essentially deal with the 
asymptotic efficiency of the test with respect to the Neyman-Pearson test do 
not, however, lead to any optimality of the uniform separation rates. Fan [8] 
also proposes a new version of Neyman's test based on wavelet thresholding 
to test that the mean of a Gaussian vector equals with applications to 
goodness-of-fit tests in a density model. When we test uniformity on [0, 1], 
our method amounts to considering, for all integer D in some set T> n , the 
unbiased estimator of 9d defined by 

-. D n 

and to penalizing this estimator by its (1 — u a ) quantile under the null 
hypothesis. The main difference between our method and the testing pro- 
cedures proposed by previous authors lies in the order of magnitude of the 
penalty term. While Ledwina [19] and Kallenberg and Ledwina [17] choose 
the parameter D by using Schwarz's Bayesian information criterion (BIC), 
Kallenberg [16] gives a discussion of the choice of the penalties for data- 
driven Neyman's tests. But the criteria considered in these papers have been 
introduced to estimate the density / itself, whereas our penalties correspond 
to the ones used to build adaptive estimators of J R f 2 by model selection in 
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[18]. This choice allows us to obtain optimal uniform separation rates with 
respect to the L2-distance. 

As for testing a composite null hypothesis, Pouet [22] proves that pro- 
vided that / belongs to L2QO, 1]) and some Holder class, the minimax rate 
of testing is comparable to the rate for the simple hypothesis "/ = Ir 0j i] •" 
However, the test proposed by Pouet depends on the smoothness assump- 
tion on /, which is not satisfactory from an experimental point of view. 
Inglot, Kallenberg and Ledwina [12] introduce a procedure using no prior 
information about the smoothness of / to test composite hypotheses like 
"/ G {f(x,f3), (3 G 2?}" with B C W. This procedure, generalizing Kallen- 
berg and Ledwina's one [17], also consists of a combination of Neyman's 
smooth test and Schwarz's selection rule. Its construction is based on the 
consideration of a sequence of exponential families with increasing dimen- 
sions to describe departures from the null model. The "right" dimension 
is selected by an extended Schwarz's rule, which is obtained by inserting 
the maximum likelihood estimator (3 of (3 under the null hypothesis in the 
original definition of Schwarz's BIC. The next step is the application of 
Neyman's smooth test using a quadratic score statistic in the selected di- 
mension. Inglot, Kallenberg and Ledwina [12] prove the consistency of the 
test at essentially any alternative. 

The approach considered in the present paper has been initiated by Ba- 
raud, Huet and Laurent [1, 2, 3] for the problem of testing linear or qual- 
itative hypotheses in the Gaussian regression model. The properties of the 
testing procedures proposed here are nonasymptotic. For each n, the tests 
have the desired level of significance and we characterize some sets of alter- 
natives over which they have a prescribed power. For the problem of testing 
goodness-of-fit of some specified density, we state in Section 2 that our pro- 
cedure is adaptive over some collection of classes of smooth functions in the 
sense that it achieves the optimal "adaptive" rate of testing established by 
Ingster [15] over all the classes of the collection simultaneously. We also in- 
vestigate in Section 4 the test from a practical point of view by Monte Carlo 
experiments. The results show that our procedure is competitive with the 
ones due to Bickel and Ritov [4] or Kallenberg and Ledwina [17]. For the 
problem of testing the hypothesis "/ G !F" where T is the translation/scale 
family defined by (1.1), we get in Section 3 uniform separation rates over 
classes of smooth alternatives of the same order (up to a logarithmic factor) 
as the rates obtained when testing the simple hypothesis "/ = /o-" We finally 
implement the procedure to test exponentiality in Section 4; we can notice 
that it gives particularly good results in comparison with the Kolmogorov- 
Smirnov test under oscillating alternatives. The proofs of the results stated 
in the paper are detailed in Section 5. 
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2. A goodness-of-fit test. Let X\, . . . ,X n be i.i.d. random variables with 
common density / with respect to the Lebesgue measure on R. Let /o be 
some given density in L2(R) and let a be in ]0, 1[. Assuming that / belongs 
to L2(R), we construct a level a test of the null hypothesis "/ = /o" against 
the alternative "/ 7^ /o" from the observation (X\, . . . ,X n ). 

In the following, || • H2 and (•,•), respectively, denote the usual norm and 
scalar product in L2QR). For any bounded function g, \\g\\oo = sup xgIK |^(a;)|- 

2.1. Description of the test. Our test is based on an estimation of the 
quantity ||/ - f \\% that is ||/||| + ||/ ||| - 2(/,/ ). Since (/,/ ) is usually 
estimated by the empirical estimator J2i=i fo(Xi)/n, the key point is the 
estimation of H/Hf- As in [18], we introduce an at most countable collection 
{S m , m G A4} of linear subspaces of IL,2(R). For all m in M, let {pi , I G C m } 
be some orthonormal basis of S m . The variable 

1 n 

(2-1) m = - E PIMM*,) 

is an unbiased estimator of ||n5 m (/)||2, where Hs m denotes the orthogonal 
projection onto S m . Then ||/ — /o||| can be estimated by 

2 n 

T m = m + H/0II2 E /o(Aj), 

n 

i=i 

for any m in M.. Denoting by t m (u) the (1 — u) quantile of the law of T m 
under the hypothesis "/ = /o" and considering 

(2.2) u a = sup(n G ]0, 1[, P /o ( sup (f m - t m {u)) > o) < a), 

we introduce the test statistic T a defined by 

T a = sup (T m - t m (u a )). 

Our test consists of rejecting the null hypothesis if T a is positive. 

In practice, the values of u a and the quantiles {t m (u a ),m G M.} are esti- 
mated by Monte Carlo experiments under fo as explained in Section 4. 

This method amounts to a multiple testing procedure. Indeed, for each 
m in M., we construct a level u a test of the null hypothesis "/ = /o" by 
rejecting this hypothesis if T m is larger than its (1 — u a ) quantile under 
the hypothesis "/ = /o-" We thus obtain a collection of tests and we decide 
to reject the null hypothesis if for some of the tests of the collection this 
hypothesis is rejected. 



G 



M. FROMONT AND B. LAURENT 



2.2. The power of the test. Let us now describe the collection of linear 
subspaces {S m ,m £ M.} that we use to define our testing procedure here. 
This collection is obtained by mixing spaces generated by constant piece- 
wise functions, scaling functions and, in the case of compactly supported 
densities, trigonometric polynomials. 

(i) For all D in N* and k in Z, let 

lD,k = ^D^[k/D,(k+l)/D[- 

For all D in N*, we define £(i,d) as the space generated by the functions 
{lD,ki k £ Z} and 

1 n 

9{1 ' D) = n(n-l) ^ ^ lD M X i) T D,k( X j)- 
^ ' k£Zi^j=l 

(ii) Let us consider a pair of compactly supported orthonormal wavelets 
((p,ip) such that for all J in N, {tp J)k = 2 J / 2 y?(2 J ■ - k),k £ Z} U {ip jik = 
2i/ 2 ^(2J • - k), j £ N, j > J, k £ Z} is an orthonormal basis of L2 (M) . For all 
J in N and D = 2 J , we define #(2,13) as the space generated by the scaling 
functions {if j t k,k £ Z} and 

1 n 

ri[ > n L > k&i^j=i 

(iii) Let us consider the Fourier basis of lL-2([0, 1]) given by 

50 (z) =I [0i i](x), 
52p-i(^) = v / 2cos(27rpx)I[o i i](3;) for all p > 1, 
92p{x) = V / 2sin(27rpx)I[o ! i](a;) for all p > 1. 

For all Z) in N*, we define S( 3 D ^ as the space generated by the functions 
{g h l = 0,...,D} and 

, D n 

We want here to notice that the constants involved in the following may 
depend on the chosen scaling function ip, but we will not always specify it. 

Introduce Bi = B 3 = N* and B 2 = {2 J , J £ N}. For I in {1,2,3}, DinBj, 
II^j D) denotes the orthogonal projection onto S^d) m ^(M). 

For all / in {1, 2, 3}, we take V x C B; with Ue{l,2,3} ^ 7^ and £>3 = if 
the Xi's are not included in [0, 1]. Let 

M = {(I, D),l£ {1,2,3}, D£V t }. 
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For all m in A4, we set 

2 n 

T m = 9 m + H/0II2 5^ fo(Xi)- 

The test statistic that we consider is 

(2.3) T a = sup (f m - t m (u a )), 

where t m (u a ) is defined in Section 2.1. 

The aim of the following theorem is to describe classes of alternatives over 
which the corresponding test has a prescribed power. 

Theorem 1. Let X\, . . . ,X n be i.i.d. real random variables with com- 
mon density f and let /o be some given density. Let T a be the test statistic 
defined by (2.3). Assume that Jq and f belong to L 00 (M) and fix some (3 
in ]0, 1[. For any e in ]0, 2[, there exist some positive constants C\{(3) and 
C2(/3,e, ll/Hoo) ll/olloo) such that, setting for all m = (I , D) inM., 

n V n J n 

if f satisfies 

||/ - /o||! > (1 + e) inf {||/ - n Sm (/)||2 + tm {u a ) + V m (j3)}, 

then 

P/(T a <0)</3. 

Comments, (i) Let us see what is the advantage of considering a mul- 
tiple testing procedure. We deduce from Theorem 1 that if we fix some 
element m in M and if we focus on the test that rejects the null hypothesis 
when T m is larger than its (1 — a) quantile under the hypothesis "/ = /o" 
denoted by t m (a), then the error probability of the second kind of the test 
is smaller than (3 for all / such that 

||/ - f \\l > (1 + e){||/ - Yl Sm (f)\\l + t m (a) + V m (P)}. 

For the multiple testing procedure, the right-hand side of the above inequal- 
ity is replaced by its infimum over all m in M, at the price that t m (a) 
is replaced by t m (u a ). When we evaluate the uniform separation rates, we 
show that for the collections {S m , m G M} that we have chosen, the quan- 
tities t m (a) and t m (u a ) just differ by a logarithmic factor. Therefore, the 
multiple testing procedure behaves almost as well as the best test among 
the considered collection of tests. 
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(ii) The key point of the proof of Theorem 1 is an exponential inequality 
for [/-statistics of order 2 due to Houdre and Reynaud-Bouret [11]. The 
same result could also be obtained with an inequality due to Gine, Latala 
and Zinn [10]. 

(iii) We prove in Section 5 that if M is finite, then for all m = (I, D) in A4, 
tm(u a ) is precisely of order y/ Dlog(\M.\/a) /n, where \A4\ denotes the car- 
dinality of Ai . This allows us to establish optimal uniform separation rates 
over various classes of alternatives. Furthermore, considering the problem of 
testing uniformity on [0, 1], if we take a collection {S m ,m £ M} which only 
contains a finite number of spaces generated by constant piecewise functions, 
we can thus see that our procedure is very close to the one proposed by In- 
gster [15]. This would therefore be satisfactory enough from a theoretical 
point of view. Our choice to use a collection of mixing spaces generated by 
constant piecewise functions, scaling functions and possibly trigonometric 
polynomials is in fact explained by the experimental results. We indeed no- 
ticed in the simulation study that such a choice mostly increases the power 
of the test. 

2.3. Uniform separation rates. Our purpose in this section is to evaluate 
the uniform separation rates of the test proposed above over several classes 
of alternatives. For s > 0, R > 0, M > and I £ {1, 2, 3}, we introduce 

Bi l \R,M) 

= {/ € L 2 (R), VD E B h ||/ - U S{LD) (f)\\l < R 2 D~ 2s , H/IU < M}. 

These sets of functions include some Holder balls or Besov bodies. To be 
more precise, we consider, for all s > and R > 0, the class of functions 
H a (R) defined by 

W s (fl) = {/:[0,l]->R,Vx,y€[0,l], 

(2.4) 

\f { ^(x)-f^(y)\<R\x-yn, 

where s = si + S2, si £ N and S2 G]0, 1]. 

Let for j in N, k in Z, (3j t k{f) = {fi^j,k)- F° r all s > and R > 0, we 
define the Besov body B s ,2,oo(-R) as 

B s ,2,oo(R) = If GL 2 (1), Vjg N,£/3? fc (/) < R 2 2- 2 A. 

I fcez J 

Then, one can see by straightforward computations that for s 6]0, 1], R > 0, 
M>0, 

H s (R)n{f, ||/|U<M}cBW(i?,M) n Bi 3 HR/y/2(4*-l),M), 
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and for s > 0, R > 0, M > 0, 

B s ,2,oo(R) n {/, H/lloo < M} C (R/VT^~ S ,M). 
The following corollary gives upper bounds for the uniform separation 
rates of our testing procedure over the classes B { s\r,M). 

Corollary 1. Let T a be the test statistic defined by (2.3). Assume that 
n>16 and that for I in {1,2,3}, £>/ is {2 J ,0 < J < log 2 (re 2 /(loglogre) 3 )} 
or 0. Fix some (3 in ]0, 1[. For all s > 0, M > 0, R> and I £ {1,2,3} such 
that T>i 0, there exists some positive constant C = C(s, a, (3, M, ||/o||oo) 
such that if f belongs to the set B^P (R, M) and satisfies 

4s/(4s+l) 



11/ 



/oi>c(^+ i )( vlog re logre 



^ 2 / (loglogra) 3 \ 2s (log log re) logn 
re 2 / n 



then 



\ 2s/(4s+l) 



P/(T a < 0) < 0. 

In particular, if 

(2.5) (loglogre) s+1/2 (logre) 2s+1 / 2 / v / ^<ii<7^ 2 7(loglog7i) 3s+1 / 2 , 

i/jere exists some positive constant C'(s,a, (3,M, ||/o||oo) suc/z £/iaf f/ie uni- 

/orm separation rate of the test Ir Q >o over Bs\r,M) satisfies 

p(I Ta>0 ,B®(R,M),(3) 

<C\s,a,P,MA\f \UR^)(^l 

V re 

Comments, (i) For the problem of testing the null hypothesis "/ = 
l[0,il" against the alternative "/ = + 9 with 9 7^ and g G B S (R)" where 
B S (R) is a class of smooth functions (like some Holder, Sobolev or Besov ball 
in L2QO, 1])) with unknown smoothness parameter s, Ingster [15] establishes 
that the adaptive minimax rate of testing is of order (\/log log n /n) 2s l ( 4s+1 ) . 
From Corollary 1, we thus deduce that the procedure that we propose is 
adaptive in the sense that it is rate optimal over all the classes bP(R,M) 
such that R belongs to the range given by (2.5) simultaneously. 

(ii) Ingster [15] considers in fact the minimax rates of testing with respect 
to general L p -distances. In particular, for 1 < p < 2, he obtains the same 
adaptive minimax rate of testing (\/log log n /n) 2s l ( 4s+1 ) . Our results can 
clearly be extended to L p -distances with 1 < p < 2 when / and /o have 
bounded support. In this case, one actually has that ||/ — /o|| p < C(p)\\f — 
/oik- 
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We focus here on classes of alternatives that are well approximated by 
their projections onto the spaces {S m ,m £ M.} under consideration. In the 
particular case where fo = I[o,i]i ° ne can see that the test may be powerful 
even for alternatives that do not have such approximation properties. This 
is the purpose of Corollary 2. 



Corollary 2. Let fo = I[o,i] - Assume that n is larger than 16 and that 
M = {(1, D),D € £>i} with V 1 = {2 J ,0 < J < log 2 (n 2 /(loglogn) 3 )}. Let T a 
be the test statistic defined by (2.3). For all s > and R > 0, consider TL S {R) 
given by (2.4). Fix some (3 in ]0, 1[. For all s > 0, M > 0, R> 0, there 
exists some positive constant C(R,s,a,j3,M) such that if f belongs to the 
set 7i. s (R) with ||/||oo < M, and if f satisfies 



/Vloglogn\ 4s /( 4s+1 ) 



\\f-f \\i>C(R,s,a,p,M) r * n ta j 

then 

P/(T a <0)</3 



Comment. The key point of the proof is an inequality due to Ingster 
([14], part III, inequality (5.16)). This inequality allows in fact to avoid 
evaluating the approximation terms ||/ — Hs TO (/)||2- Although the functions 
/ in 7i s (R) are not well approximated by their projections onto the spaces 
{S(i t £)-),D G T>i} when s > 1, we thus prove that the corresponding testing 
procedure still achieves the adaptive minimax rate of testing over 7i s (R) for 
any s > 0. 



3. Testing a parametric family. Let X\ X n be i.i.d. real random vari- 
ables with common density /. In this section we consider the problem of 
testing that / belongs to some translation/scale family of the form 

where fo is some given density and K is some subset of R x ]0, +oo[. The 
families of Gaussian, uniform or exponential densities and translation models 
are typical examples of such translation/scale families. 



3.1. Description of the test. The testing procedure introduced below is 
essentially based on the idea that if / belongs to J 7 , there exists (//, a) in K 
such that the density of the variables (Xi — pi) jo is /o- As in Section 2.1, 
we take an at most countable collection {S m ,m £ M} of linear subspaces 
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of L2(M). For all m in M, we consider an orthonormal basis {pi,l 6 C m } of 
S m composed of right-continuous functions and we set 

T m (Xi, . . . ,X n ) 

1 ^ ^ f Xi — fi\ ( Xj — fj, 



(3-1) = inf V-r V F Pi — 



Pi 



M^\n{n-l) ^ m .^ =l \ a J \ a 

Since the functions are right-continuous, the infimum over a) in if can 
be replaced by the infimum over (/x, <r) in if n Q 2 so that T m {X\ , . . . , X n ) is 
a random variable. 

We reject the null hypothesis "/ £ J 7 " if 

T Q = sup (T m (X 1 ,. . . , X n ) - q m>a ) 

is positive, where {q m ,on m £ M} is a family of positive numbers such that 
(3.2) supP/(f a >0) <a. 

Let us explain how we choose {q m ,a, fn S M}. We distinguish two cases. 

(i) The first one corresponds to the case where for all m in Ai , the variable 
T m (Xi, . . . ,X n ) defined by (3.1) satisfies 

(Xl-fJL X n -^ 



(3.3) V(n,a)€K T m {X u . . . ,X n ) = T n 



This equality holds if, for instance, K = R X ]0, +oo[, K = {0} x ]0, +oo[ or 
K = R x {1}. In this case, we take q m ,a = t m (u a ), where t m (u) is the (1 — u) 
quantile of T m (Xi, . . . ,X n ) under the hypothesis "/ = /o," and u a is taken 
as 



II r 



supine ]0,1[, F fo ( sup (f m (X u ...,X n )-t m (u))>o) <a). 



The quantities u a and {t m (u a ),m 6 M} can be estimated by Monte Carlo 
experiments. Let us see how this choice leads to inequality (3.2). Under the 
null hypothesis, there exists (u, a) in K such that the density of the variables 
{Xi — (J,) /a is /q. From (3.3), one can then deduce 



/( sup (f m (X 1 ,...,X n )-t m (u a )) >0) 



V( sup (f m ,(^-^,...,^-^)-t m (n a ))>0 



Vo ( SU P {Tm(Xl, . . . , X n ) i m (u a )) > 
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and according to the definition of t m (u a ), this probability is at most a. 

(ii) The second one corresponds to the case where (3.3) is not satisfied. 
This occurs, for instance, if K is a compact set of R x ]0, +oo[. Here, we take 
Qm,a = t m (u a ) where, as in Section 2.1, t m (u) is the (1 — u) quantile of the 
variable 

1 n 2 n 

f m (Xx,...,X n )= ]T J2 Pi(X i )pi(X 1 ) + \\f \\ 2 2 --J2fo(X i ) 

under the assumption that the variables X±, . . . ,X n are i.i.d. with common 
density /o, and u a is defined as 

u a = sup\ ue}0, 1[, P/ ( sup (f m (Xi,. . .,X n ) - t m (u)) > ) < a\. 

I \rneM / J 

Inequality (3.2) also holds in this case: if / belongs to J-, there exists (/i,<r) 
in K such that / = /o((- — fi)/a)/a. By definition of T m (Xi, . . . ,X n ), one 
has the inequality 

T m (Xi, . . . , X n ) < 

\ a a J 



Hence, 

P, 



(f a > 0) < F f ( sup (f m (^A • • • , ^—^j - t m (u a )j > o) . 



Since the variables (Xj — /x) /a have /o as common density, it follows from 
the definition of u a that the above quantity is smaller than a. 

We shall remark that the choice of {qm,on m £ M} proposed in (ii) may 
lead to a conservative procedure. It is therefore preferable to use the proce- 
dure proposed in (i) whenever condition (3.3) holds. 

3.2. The power of the test. In the following, we use the same notation 
as in Section 2.2. 

(i) For all D in Di = N* and m = (1, D), we define 

1 n 2 n 

T m (Xi, . . . , X n ) = — — — — ^2 ^2 I D,k( x i)lD,k(Xj) + H/0II2 ^2fo(Xi). 

n[n L > k&i^j=l n i=l 

(ii) Choose the scaling function <p such that it satisfies the Lipschitz con- 
dition 

\/x,y£R \tp(x) - tp(y)\ < C<p\x - y\. 

For all D = 2 J in B 2 = {2 J , J G N} and m = (2, £>), we define 

1 n 2 n 

f ni (X 1 ,...,X n ) = - -]T ^ ^ Jifc (X 4 )^j, fc (X J ) + ||/o|||--E/o(^)- 
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We recall that we do not specify the dependence on 99 in the involved con- 
stants. 

Let Vi C Pi and V 2 C B 2 such that V 1 UV 2 ^0 and let 

M = {(l,D),le{l,2},DeV l }. 

For all m in Ai, we set 



T m (Xi , . . . , X n ) — inf T m 



a 



a 



We consider the test statistic 



(3.4) 



T a = sup (T m (Xi, . . .,X n ) - q m , a ), 



where {q m ,a, tn G -M.} is a family of positive numbers satisfying (3.2). 



Theorem 2. Let X±, . . . , X n be i.i.d. real random variables with common 
density f G Loo(R). Let 



a 



/' 



(ji, a) G K 



where fo is some given bounded density and K = \pL,~p] x [a, a], n,~p,,g_,!j 
being real numbers such that a > 0. Suppose that the following hypotheses 
hold: 

(hi) There exists some constant Cf > such that for all x,y in the sup- 
port of f , (fJ,,cr) in K, (fj,',a') in K, 



k 



x — fl 
a 



fo 



a' 



<C 



fo 



x-n y - n 



a 



a' 



(/12) There exist v > and c > such that for all k>2, 

n\Xi\ k ) < ^c k -\ 

Let T a be the test statistic defined by (3.4). Assume that T>2 7^ and that n 
is large enough so that n > 3 and n 2 (JZ — /x) A (a — a) > 2. Fix some (3 in ]0, 1[. 
For all e in ]0, 2[ ; there exists some positive constant C = C(n,~p,a,a, ||/o||oo, 
C/o' ll/lloo, v, c, (3, e) such that, setting 



V D (f3) = C 
if f satisfies 



n 



log(ra 2 D) + ^ log 2 {n 2 D) + 



\og(n 2 D) 



71- 



n 



inf a 



1 



f--fo 
a 



/' 



a 



14 



M. FROMONT AND B. LAURENT 



>(!+£) inf { sup \\af(a. + »)-Il S(2>D) (af(a.+fi))\\ 2 2 



+ V D ((3) + q {2jD)j 



a 



} 



then 



P/(T a <0)</3. 



Comments, (i) In Theorem 2, we only consider the case where K is 
compact. This is due to technical reasons: we have to evaluate the supremum 
over (fi, a) in K of [/-statistics of order 2 depending on (/x, a) . By considering 
a finite grid on the compact set K, we reduce the problem to control of a 
finite number of these [/-statistics and we can use the inequality due to 
Houdre and Reynaud-Bouret [11] again to control each of them. 

(ii) The condition (hi) is satisfied by families of Gaussian densities (when 
/o(x) = e~ x / 2 /\/27r) whatever the support of / and also by families of 
exponential densities (when fo(x) = e~ x I x >o and K C {0} x ]0, +oo[) when 
the support of / is included in [0, +oo[. As for families of uniform densities 
(when /o = ^[o,i])> the condition (h±) is not satisfied but the result still holds; 
to see this, we refer to a theorem stated in [9] , where this condition is replaced 
by some L2-entropy with bracketing condition on T . 

(iii) As pointed out by a referee, the condition (h 2 ) can be slightly weak- 
ened. Bernstein's inequality used in the proof of Theorem 2 actually still 
holds when E[e iXl ] < e c '* 2 / 2 for < t < T (see [21], Section 2.2). 

3.3. Uniform separation rates. As in Section 2.3, Theorem 2 allows us 
to evaluate the uniform separation rates of our test over classes of smooth 
functions. For all s > 0, R > 0, M > 0, we consider the set 



B a (i2,M) = {/GL 3 (M),||/|| 0O <M, VDGl 2 , V(ji,<r)€K, 

\\af(a. + M ) - II 5(2 (af(a. + ^))|| 2 < B?a^D~^}. 



Such a set contains, among others, the functions / belonging to some Besov 
ball and satisfying the inequality ||/||oo < M. To see this, in the notation of 
DeVore and Lorentz [7], we introduce, for all h > and r £ N*, 




k=0 




0<h<t 
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Then / belongs to the Besov ball B S ^ 2)00 (R) if for r = [s] + 1, 

supt~ s uj r (f,t) 2 < R. 
t>o 

One can easily see that 

uj r (af(a. + n),t) 2 = a 1/2 uj r (f, at) 2 . 

Let us now recall an inequality due to DeVore, Jawerth and Popov [6]: if 
the wavelet tp satisfies that for all j < r, J x 3 ij)(x) dx = 0, then, for every 
function g in L2(M), for all j > 0, 

fcgZ 

where C is an absolute constant. Hence, if / belongs to the Besov ball 
B St 2,oo(R), for all J > 0, for all (//, a) £ K, 

\\af(a. + /i) - n 5 (af(a. + //)) ||* = £ £ + //)) 

j>Jk£l 

<C{l-4T s )~ l R 2 a 1+2s 2~ 2Js . 

If, in addition, ||/||oo <M, then / G B s (C l l 2 (\ — 4 _s ) _1 / 2 i?, M). 

The following corollary gives upper bounds for the uniform separation 
rates of the testing procedure over the classes B S (R,M). 

Corollary 3. Assume that the conditions of Theorem 2 are satisfied. 
Let T a be the test statistic defined by (3.4) with q m ,a = tm{ua) as explained in 
Section 3.1 [case (ii)]. Choose V 1 C {2 J , < J < log 2 (n 2 )} andV 2 = {2 J ,0< 
J < log 2 (ra 2 /log 3 re)}. Let (3 g]0, 1[. For all s > 0, M > 0, R> 0, there exists 
a positive constant C = C(fi,JI,a,o : , ||/o||oo,C/ ,M, z^, c, a,/3, s) such that, if 
f belongs to the set B S (R,M) and satisfies 



inf 

(/li,<t)6K 



/--/< 



1 t (--V 





a \ a 



2 



2 



> c ( R 2/(4s + i) (V^gn\ 4s/{4s+1) + R 2 ( (fog^)! \ 2s + ]ogn\ 
V \ n J \ n 2 J n J 

then 

P/(T a <0)</?. 

Comments, (i) When R satisfies i}ogn) s+1 l 2 / \/n < R < n 2s /(logra) 3s+1//2 , 
the uniform separation rate over the class of functions belonging to B S (R, M) 
and satisfying (hi) and (h 2 ) is bounded from above by 

)1/(4s+1) /Vfo^\ 2s/(1+4s) 



C"(£t, fj,, a, cr, ll/olU, Cf M, v, c, a, /3, s)R 



n 
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This corresponds, up to a logarithmic factor, to the rate over the classes 
B$\r,M) for the test of simple hypotheses obtained in Corollary 1. We do 
not know if this logarithmic factor can be avoided. 

(ii) As in Corollary 1, the result can be extended to L p -distances with p 
in [1,2] when / and /o have bounded support. 

4. Simulation study. 

4.1. Test of uniformity on [0,1]. We first present simulation results for 
the problem of testing that the distribution of some i.i.d. random variables 
X±, . . . ,X n with values in [0,1] is uniform on [0,1]. In order to implement 
our procedure, we have to choose the set M. = {(l,D), I £ {1, 2, 3}, D G V{\ 
that occurs in the definition (2.3) of the test statistic T a . We present two 
cases. In the first case, we consider only trigonometric polynomials. We take 
T>x = T>2 = and P3 = {1, 2, . . . , D tT }. Setting go = Ij ,i] and for all p > 1, 

92p-i{x) = v2cos(2p7ra;)Iro i i](a;), and g2 P (x) = v / 2sin(2p7rx)I[ 0i i] (x), the test 
statistic is based on orthogonal projections onto the spaces spanned by the 
functions {<?/,/ = 0, ...,D} for D in D3. The second case consists in mix- 
ing trigonometric polynomials and constant piecewise functions: we take 
T>i = {2, 3, ... , Dd} and V3 = {1, . . . , D tr }. The tests corresponding to these 
two cases are, respectively, denoted by T tT and T tr /ct- We compare their pow- 
ers with the powers of the tests proposed by Kallenberg and Ledwina [17] 
(denoted by Tkl), Bickel and Ritov [4] (denoted by Tbr) and Kolmogorov 
and Smirnov (denoted by Tks)- As explained in the Introduction, Kallen- 
berg and Ledwina propose a test of uniformity on [0, 1] which is a data- 
driven version of Neyman's test [20] . They consider the orthonormal system 
{4>U ' > 0} of L2QO, 1]), where 4>o = Ir 0) i] and the 4>i 7 s for / > 1 are the or- 
thonormal Legendre polynomials on [0,1]. They decide to reject the null 
hypothesis if the statistic Tb = J2E=i( n ~ X)iLi0zpQ)) 2 is large, D being 
chosen in {1, . . . , d(n)} via Schwarz's Bayesian Information Criterion. The 
critical value is estimated by simulations. The test proposed by Bickel and 
Ritov is based on the statistic 

T n = max {T n , D -D)/V2D, 

l<D<d(n) 

where T n>D = i Ya=i E?j=i 2 cos^ttX;) cos^ttX,-). 

We focus on the alternatives studied in the paper by Kallenberg and 
Ledwina [17], for which the power of Bickel and Ritov's test is also given. 
These alternatives are 

f(p,j)( X ) = 1 + PCOs(jTTx), 

9( P ,q,e)(x) = 1 - e + e(3 p>q (x), 

hpj)( x ) = 1 + P ( t'j(x), 
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where f3 Piq is the Beta density with parameter (p,q) and {4>j,j > 1} is the 
family of orthonormal Legendre polynomials on [0, 1] . 
We have chosen a level a = 5%. 

The value of u a and the quantiles {t m (u a ), m G M.} are estimated by 
40,000 simulations. We use 20,000 simulations for the estimation of the 
(1 — u) quantiles t m (u) of the variables T m = 9 m + \\f0W2 ~~ 2n _1 J2i"=i /oPQ) 
under the hypothesis "/ = /q" for u varying on a regular grid of ]0,a[ and 
20,000 simulations for the estimation of the probabilities Pj (sup mg _ A/1 (T m — 
t m (u))>0). 

Tables 1 and 2 present the estimated powers for the tests T tr , T tr / ct , Tkl, 
7br and 7ks under various alternatives for a number of observations equal to 
50 or 100. The powers of the tests T tr , T tr / Ct and Tks are estimated by 5000 
experiments and the levels by 20,000 experiments. Hence, with confidence 
95%, the estimation error is less than 0.3% for the levels and less than 1.3% 
for the powers. 

For a number of observations equal to 50, Kallenberg and Ledwina take 
g?(50) = 10. We choose D tT = 6, D ct = 6, and we obtain the results in Table 
1. 

Table 1 

Estimated powers of the test of uniformity on [0, 1] for n = 50 with D tI = 6 and D ct = 6 



Alternatives f( p ,j) 



(p,j) 


Ttr 


Ttr/ct 


Tkl 


Tbr 


Tks 


(0.5,2) 


0.61 


0.56 


0.56 


0.48 


0.29 


(0.7,4) 


0.80 


0.77 


0.50 


0.71 


0.16 


(0.7,6) 


0.69 


0.62 


0.23 


0.60 


0.10 






Alternatives g( p 








(p,q,e) 




Ttr/ct 


Tkl 


Tbr 


Tks 


(3,3,1/2) 


0.55 


0.49 


0.53 


0.40 


0.14 


(10,20,0.25) 


0.46 


0.49 


0.36 


0.41 


0.33 


(2,2,0.8) 


0.62 


0.55 


0.63 


0.44 


0.15 


(2,4,0.5) 


0.57 


0.60 


0.55 


0.58 


0.64 






Alternatives /i( p ,j) 








Ttr 


Ttr/ct 


Tkl 


Tbr 


Tks 


(0.4,2) 


0.69 


0.65 


0.70 


0.59 


0.32 


(0.3,5) 


0.16 


0.16 


0.13 


0.14 


0.07 






Estimated levels 










Ttr/ct 


Tkl 


Tbr 


Tks 




0.051 


0.055 


0.061 


0.031 


0.050 
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Table 2 

Estimated powers of the test of uniformity on [0, 1] for n = 100 with D tr = 12 and 

D c t = 10 



Alternatives f( p j) 


(P,j) 




^"tr/ct 




Tbr 


Tks 


(0.5,2) 


0.87 


0.85 


0.87 


0.84 


0.53 


(0.7,4) 


0.98 


0.98 


0.83 


0.98 


0.29 


(0.7,6) 


0.97 


0.96 


0.46 


0.95 


0.19 






Alternatives g^ p 








(p,q,e) 


T tr 


T tr/ct 


7kl 


7"br 


Tks 


(3,3,1/2) 


0.83 


0.77 


0.88 


0.76 


0.35 


(10,20,0.25) 


0.77 


0.78 


0.62 


0.75 


0.60 


(2,2,0.8) 


0.90 


0.86 


0.95 


0.82 


0.36 


(2,4,0.5) 


0.87 


0.89 


0.88 


0.90 


0.91 






Alternatives hi p j\ 






(PJ) 




^"tr/ct 


7kl 


7"br 


Tks 


(0.4,2) 


0.93 


0.91 


0.95 


0.90 


0.60 


(0.3,5) 


0.33 


0.31 


0.23 


0.33 


0.09 






Estimated levels 








%r 


^tr/ct 


Tkl 


Tbr 


Tks 




0.050 


0.048 


0.056 


0.031 


0.054 



For a number of observations equal to 100, Kallenberg and Ledwina take 
c?(100) = 12. We choose Dt T = 12, D ct = 10, and we obtain the results in 
Table 2. 

Comments. In this simulation study, the alternatives that we consider 
are of three kinds. The alternatives f( p j) correspond to the uniform density 
contaminated by a cosine function. They are favorable to our tests and Bickel 
and Ritov's test since these tests are based on trigonometric polynomials. 
It is therefore natural to compare our power results with the results of Tbr. 
The main difference between the two procedures lies in the fact that 0(3 : d) 1S 
an unbiased estimator of the squared IL-2-norm of the orthogonal projection 
of / onto S^d); whereas {T n j) — D)/(n — 1) is an unbiased estimator of 
J2h=i(I f(x)V2cos(lnx) dx) 2 only under the null hypothesis. The consequent 
bias term under some alternative / may be of order D/n, which does not 
allow us to establish optimal uniform separation rates for the test proposed 
by Bickel and Ritov. This explains why the power of % T improves the power 
of 7br in all cases. 
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The alternatives h( p j\ are more favorable to the test due to Kallenberg 
and Ledwina since this test is based on Legendre polynomials. However, 
under the alternative fyo.3,5), the test % T improves the results of 7kl and 
under the alternative fyo.4,2)) the estimated power of T tr is comparable to 
that of 7kl- 

Since the functions 9( P! g :£ ) correspond to some "neutral" alternatives, we 
can focus on them. When the number of observations is equal to 100, the 
powers of our tests T tr and T tr / Ct are at least equivalent to the powers of 
7kl and 7br for half of the considered cases. As for the other cases, the 
procedures T tT and T tv / Ct are still more powerful than Tbr. For small sample 
sizes (n = 50), our test % r is always at least as powerful as the tests 7kl 
and T B r. 

4.2. Other goodness- of- fit tests. We are now interested in testing that the 
density / of the random variables X±, . . . , X n is a given density /o, with /o 7^ 
1(0,1] • To test such a hypothesis, we have two possible procedures: the first one 
consists in testing directly from the sample X±, . . . ,X n the null hypothesis 
"/ = /o" as explained in Section 2.1. The second one consists in testing that 
the common distribution of the variables Fq (X\ ),..., Fo(X n ), where Fq is 
the distribution function associated with the density /o, is uniform on [0, 1]. 
This approach is the one which is proposed in most papers. Whereas the 
two procedures are equivalent for the Kolmogorov-Smirnov test, they are 
not for our method based on the estimation of an L2-distance. Indeed, in 
our case, the first test is based on the estimation of ||/ — /oil I- Since the 
density of Fq(Xi) is given by h(x) = f (Fq 1 (x)) / /q(F q ~ 1 (x)) when Fq is one 
to one, the second test is based on the estimation of 

/ (h(x)-l) 2 dx=\\(f-fo)/^T \\ 2 2 . 

J[0,1] 

To compare in practice these two procedures, we have chosen to test that the 
density / is Gaussian with mean and with variance 1 first, with variance 
0.01 second. 

The choices we have made in order to implement our procedures are 
the following ones: for the direct test from X±,... ,X n denoted by 7^, the 
set M = {(l,D),l £ {1,2,3}, D £ V{\ is taken such that V 2 = V 3 = and 
V\ = {1, ... , 10}, and for the second test from Fq(Xi), . . . , Fq(X u ), we apply 
the test ^tr/ct described in Section 4.1 with D ct = 10 and Dt r = 12. We also 
present the estimated powers of the Kolmogorov-Smirnov test that we still 
denote by 7ks- 

We have taken a number of observations n = 100 and a level a = 5%. 
The quantiles are estimated as above with 40,000 simulations, the powers of 
the tests with 5000 simulations and the levels with 20,000 simulations. The 
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Table 3 
Estimated powers for n — 100 



Test of normality A/"(0, 1) Test of normality A/"(0, 0.01) 



Alternatives /, 



rn 


T d 


Ttr/ct 


Tks 


rn 


T d 


Ttr/ct 


Tks 


2 


0.96 


0.92 


0.62 


0.17 


0.93 


0.64 


0.24 


1.8 


0.66 


0.66 


0.36 


0.16 


0.87 


0.71 


0.14 




0.71 


1 


0.07 


0.12 


0.99 


1 


0.14 








Alternatives <7( mi(T 2) 








(m,<r 2 ) 


T d 


Tr/ct 


Tks 


(m,o- 2 ) 


T d 


Tr/ct 


Tks 


(1,1) 


0.80 


0.98 


0.77 


(0.1,0.01) 


1 


0.98 


0.77 


(0.5,2) 


0.66 


0.98 


0.70 


(0.05,0.015) 


0.91 


0.77 


0.35 


(1,2) 


0.97 


1 


0.97 


(0.05,0.02) 


1 


0.97 


0.68 








Alternatives h p 








P 


T d 


Tr/ct 


Tks 


P 


T d 


Tr/ct 


Tks 




0.24 


0.95 


0.42 




0.96 


0.95 


0.41 


3/2^2^ 


0.85 


1 


0.96 


15/^2^ 


1 


1 


0.96 








Estimated levels 










T d 


Tr/ct 


Tks 




T d 


Tr/ct 


Tks 




0.052 


0.051 


0.053 




0.053 


0.055 


0.053 



alternatives that we have considered are the following ones (see Table 3): 

frn{%) = ~ If— m.m]i 

zv lira 

hpix) = |eP% <0 + |e-P%> . 

Comments. The first objective of this simulation study is to compare 
our tests with the Kolmogorov-Smirnov test. The estimated power of the 
most powerful of our tests is larger than that of Tks f° r most of the consid- 
ered alternatives. In these cases, the difference in power is really significant 
as we can see, for example, for the alternatives f m . 

The second objective is to compare the tests 7^ and T tr / C f We can notice 
that we reject the null hypothesis more often with the test of uniformity 
from the i*b(JQ)'s when we test that the density is Gaussian with variance 
1, whereas the direct test performs better when we test that the density is 
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Gaussian with variance 0.01. As explained above, this is due to the fact that 
IK/ — /o)/v / 7olli i s larger than ||/ — /o 111 when /o is the standard Gaussian 
density but smaller when /o is the Gaussian density with variance 0.01. 

4.3. Testing a parametric family. We now implement the testing pro- 
cedure described in Section 3 in order to test that the density / of the 
observations X±, . . . ,X n is an exponential density or, in other words, that / 
belongs to the set of densities 

? = {f, f{x)=a- 1 e- x 'n x > , <r>0}. 

To simplify the implementation, we base our test statistic on constant 
piecewise functions instead of scaling functions. For all D in {2, . . . , 10}, we 
define 

T(i,d){Xi, ■ ■ -,X n ) 

n 

) I {x i / e 7e[fe/D,(fe+i)/-D[} I[ {x j / -e[fe/ J D,(fe+i)/D[} 

> k>0i=£j=l 

+ ll/.III-^|>(f)}, 

where fo is given by fo(x) = e~ x I x >o- It is easy to see that for all a > 0, 
T{i,D)(Xi/a, . . .,X n /a) = T( ljD ^(Xi, . . .,X n ). 

Let M. = {(1, D), D = 2, . . . , 10}. As explained in Section 3, we obtain a 
level a test as follows: denoting by t m (u) the (1 — u) quantile of T m (Xi, . . . , X n ) 
under the hypothesis "/ = fo," and setting 

u a = supl ue]0, l[,P/ ( sup (f m (Xi, . . .,X n ) - t m (u)) > ) < a 

we reject the null hypothesis if 

sup (f m (Xi, . . . , X n ) -t m {u a )) >0. 

The quantities u a and {t m (u a ),m S A4} are estimated by Monte Carlo ex- 
periments: 20,000 simulations are used to estimate the values of {t m (u), 
m G M} for u varying on a regular grid of ]0,a[, and 20,000 simulations are 
used to estimate the value of u a . 

We compare the performance of our test with that of the Kolmogorov- 
Smirnov test described below. Let 

n 

X n = n~ 1 Y,X l 

i=i 
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and let F n be the empirical distribution function defined for all t in R by 

n 



i<t- 



i=i 



Let F a be the distribution function associated with the exponential density 
with parameter u -1 : F a (t) = (1 — e _ */°")IL,>o. Under the hypothesis that the 
distribution of the AVs is exponential with parameter <t -1 , the law of the 
statistic D n = sup tgR \F n (t) — (t)\ is free of the parameter a. Let d n ^ a 
denote the (1 — a) quantile of D n under the assumption that the AVs have /o 
as common density. The Kolmogorov-Smirnov test of exponentiality consists 
in rejecting the null hypothesis "/ G JF" if D n > d nt i- a . We consider the 
alternatives defined for x > by 

(p even), 



9 P {x) 


= (e~ x + (1 + sin(p7rx))I 0<:c <i)/2 


h p {x) 


= (e- x + (1 + cos(pirx))I 0<x< i)/2, 


k{p,q,e) i x ) 


= (l-e)e~ x +e(3 p>q (x), 




= (l-e)e~ x + £7 Pi g(x), 


t(x) 


= e -(logx) 2 /2 /(x ^ )) 


v(x) 


= v^e^ x,/2 /(2 3/2 r(3/2)), 


w(x) 


= 1.5x°- 5 e- xl \ 



where (3 p ^ q and 7 Pi9 , respectively, denote the Beta density and the Gamma 
density with parameters (p,q). 

For each alternative, the power of the test is still estimated by 5000 sim- 
ulations. The levels are estimated by 20,000 simulations. We choose n = 100 
and a = 5%. 

Table 4 presents the estimated power of our test denoted by 7^ and of 
the Kolmogorov-Smirnov test denoted by F^ s . 

Comment. We can see in Table 4 that our test is not always much 
more powerful than the Kolmogorov-Smirnov test under very smooth alter- 
natives like v and w which respectively correspond to the chi-square with 
three degrees of freedom and the Weibull with parameter 1.5. However, 
under oscillating alternatives like g p , h p , fc( Pl g |e ) and Z( Pig|£ ), for which the 
Kolmogorov-Smirnov test is known to fail, our test performs much better. 
We could furthermore expect better results for our procedure with regular 
scaling functions instead of constant piecewise functions that we have chosen 
to make the implementation easier. 



5. Proofs. 



ADAPTIVE GOODNESS-OF-FIT TESTS 



23 



Table 4 

Estimated power of the test of exponentiality for n = 100 



Alternatives g p 




Alternative i 


P 1~' a 


r' 

KS 


'v KS 


4 0.89 


0.74 


0.75 0.45 


Alternatives hp 




Alternative u 


P T' a 


-r' 

■* KS 


^"a ^"ks 


4 0.71 
1 1 


0.60 
0.90 


0.67 0.65 


Alternatives fc( P]9 




Alternative to 


(p,q,e) T' a 


1 KS 




(10, 20, 0.25) 0.91 


0.65 


0.97 0.98 


Alternatives l( p , q , 




Estimated levels 


(p,q,e) T a 


^"ks 


T a ^"ks 


(2, 5, 0.5) 0.53 
(2, 5, 0.75) 0.89 


0.28 
0.60 


0.053 0.051 



5.1. Proof of Theorem 1. The main tool of the proof is the canonical 
decomposition of the [/-statistics 9 m defined in Section 2.2. We introduce 
the processes U n and P n defined by 

2 n \ n 

Un(h) = - Yl h(Xi,Xj), P n (h) = -Y J h(X l ). 

n(n — 1) . rr". n f— f 

We also define -P(/i) = (h,f). Using the same notation as in Section 2.2, let 
for D in N*, J in N, 

(i)-C(i,c)={°} xZ an d {K>i^(i,o)} = fe' ie2 }> 
(5.1) (ii) £ (2)2 J) = { J} X Z and {pj, Z € £(2,2')} = {</?./,&> k 6 Z}, 

( ui )£(3,D) = {0,1,..., £>} and {p h leC(3 )D )} = {gi,l = 0,...,D}. 

By setting, for all m £ A4, 

H m (x,y)= (pi(x) -ai)(pi(y) -at), 

with ai = {f,pi), we obtain the decomposition 

§ m = U n {H m ) + (P n - P)(2U Sm (f)) + \\U Sm (f)\\l 
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Let us fix some (5 in ]0, 1[. Recalling that 

P/(T a <0)=P/[ sup (L + \\fo\\i--f2fo(X i )-t m (u a )) <0], 
\meM\ n. =1 J J 

we have 

(5.2) F f (T a < 0) < inf P, (§ m + ||/ ||| /oPQ) - *m(«a) < J . 

Since ||/- n Sm (/)||l = II/IH -||n 5m (/)||I, 

P/(T a < 0) < inf ¥ f (U n {H m ) + (P n - P)(2n 5m (/) - 2/) 

m£.A/f 

(5.3) +(P n -P)(2/-2/ ) 

+ \\f-M\l<\\f-n Sm (f)\\l + t m M). 

We then need to control U n (H m ), (P n - P)(2U Sm (f) ~ 2/), (P„ - P)(2/ - 
2/o) for every m in jVl. 

(a) Control of U n (H m ). We use the following lemma, which derives from 
an exponential inequality for [/-statistics of order 2 due to Houdre and 
Reynaud-Bouret [11]. 

Lemma 1. Let X±, . . . ,X n be i.i.d. real random variables with common 
density f G Loo(R). Let Di = D 3 = N* and D 2 = {2 J , J G N}. For a// m = 
(Z,-D) wrfa Z G {1,2,3} and D G Bj, introduce {pi,l G £ m } defined as in (5.1) 
and 

1 n 

■^m = — 7 TT 7> H m {Xi,Xj), 

n(n - 1) . fr^, 

H m {x,y) = (^( x ) ~ (f>Pl))(Pl(y) ~ (f>Pl))- 

m 

There exists some positive constant C ( depending only on (p) such that, for 
all I G {1,2,3}, D£B l; x>0, 

(5.4) F(\Z m \ > £ (v^dl/IU + ^fillZ > ll/lloo* + ^-)) < 5.6e-*. 
The proof of this lemma is detailed in [9] . 

By setting A = log(3//3) and A = A + log(5.6), Lemma 1 gives that there 
exists some constant C > such that, for all m = (l,D) in A4, 

(5.5) P / ^[/ n (F m )<-^(||/|| 00 + V t^)V^+||/||ooA+^)) <f- 
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We deduce from (5.3) and (5.5) that 
P f (T a < 0) 

< | + inf h f ((P n - P)(2U S(l (/) - 2f) + (P n - P)(2f - 2/ ) 

(5.6) 

+ 11/ - Ml < 11/ - n S(JiD) (/)||^ + % D) K) 

+ 1 ((ll/lloo + v^mU )V^+ 11/llooA + ^)) }• 

(b) Control of (P n - P)(2U Sm (f) ~ V) and (P n - P)(2f - 2/ ). We 
now use the following lemma due to Birge and Massart [5], which provides 
a special version of Bernstein's inequality. 

Lemma 2. Let X%, . . . ,X n be independent random variables satisfying 
the moment condition 

1 n h\ 

-Ym\Xi\ k ) < -vc k ~ 2 for all k> 2, 

n f— \ 2 

i=i 

for some positive constants v and c. Then, for any positive x, 

P (i£(X i -E M )>^ + -)<e-'. 

In particular, if for all i in {l,...,n}, \Xi\ < b and ¥,(Xf) < u, the above 
inequality is satisfied with c = 6/3. 

It is easy to check that there exists some constant C > such that for all 
I in {1,2}, D in B h 

\2U S(LD) (f)(X i )-2f(X l )\<CV\\oo. 

Moreover, it is proved in [7], page 269, that one can take C such that for 
all D in D 3 , 

\2IL S(3D) (f)(Xi) - 2f(X i )\ < C'H/IU \og(D + 1). 

Since 

E(2n Sm (/)(x i ) - 2f(x t )f < 4||/|| 00 ||n 5m (/) - f\\l 

we can deduce from Lemma 2 that for all m = (l,D) GM, 

p f (Vn - p)m Sm (/) - 2/) < -2 V ^munn 5m (/) - /n 2 

c^muiogCD+i) \ < p 

3n J ~ 3 ' 
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By using the elementary inequality 2ab < 4a 2 je + £& 2 /4, we obtain that for 
m=(l,D)e M, 

F f ((P n - P)(2U Sm (f) - 2f) + E -\\U Sm (f) - f\\l 

(5.7) 

/8 cqog^ + mii/iiooA N p 

\e 3 ) n ) ~ 3' 

The control of (P n — P)(2f — 2/o) is computed in the same way and we get 

(5.8) 



» / ((P n -P)(2/-2/o) + i ||/-/ ||^ 



<-(2(| + i)||/|u + |||^u)^<f 

Finally, we deduce from (5.6)-(5.8) that if there exists some m = (l,D) in 
M. such that 

i-j)ll/-/o||l>(i + |)ll/-n5 m (/)||^ 

C / / / DA 2 

+ - (ii/iu + xArnu wdx + ii/iua + — 



oo j V J^^ 1 ! \\J ||oo / 
fl\ v n 

then 

P/(T« < 0) < 0. 
This concludes the proof of Theorem 1. 

5.2. Proof of Corollary 1. Assume that for all I in {1,2,3}, T>i = or 
{2 J ,0< J<log 2 (n 2 /(loglog?i) 3 )}. 

5.2.1. An upper bound for t m (u a ) , mEAi. 

Proposition 1. There exists some positive constant C(a) such that, 
for all m = (I, D) in M , 

t m (u a ) < W m (a), 

where 

W m (a) = ^ f (H/olU + V^)v/^lo^ + Z3(lQglQgn)2 
n \ v n 



rolloo 



(log log n) logn . 
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Proof. Recall that t m (u) denotes the (1 — u) quantile of the distribution 
of T m under the hypothesis "/ = fo" We first notice that for n > 16, \A4\ < 
3(1 + log 2 n 2 ). So, setting a n = a/(3(l + log 2 n 2 )), 



fo 



rneM 



sup (f m -t m (a n ))>0)< J2 F .fo(T m ~ t m (a n ) > 0) 



meM 

a 



^ + logo n T 

rneM v 



<a. 



By definition of u a , this implies that a n < u a and for all m in A4 , 

It thus remains to give an upper bound for t m {a n ). Let m = (l,D) £ M. We 
use the same notation as in the proof of Theorem 1 to obtain the decompo- 
sition 

f m = u n (H m ) + (p n - p)(2n Sm (/)) - 2P n (/ ) + \\M\l + \\u Sm (f)\\l 

Under the null hypothesis "/ = /o," 

f m = U n {H m ) + (P n - P)(2n 5m (/ ) - 2/ ) - \\M\l + ||n 5m (/o)||^ 

Since ||n Sm (/ ) - /o 111 = ll/olli ~ ll n s m (/o)||2 5 w e obtain that, under "/ = 
fo" 

(5.9) f m = U n {H m ) + (P n - P)(2n Sm (/ ) - 2/ ) - ||n Sm (/ ) - /oil'- 

As in the proof of Theorem 1, we control U n (H m ) from Lemma 1 and (P n — 
P)(2iI 5m (/ ) - 2/ ) from Lemma 2. We set X n = log(2/a n ) and X n = X n + 
log(5.6). 

On one hand, Lemma 1 leads to 



C 



¥ f JU n (H m )>-l V^dl/olloo + Vll/olloo) 
(5-10) 

+ ,|/ ,UA„ + ^))^ 

On the other hand, since 

(5.11) |2(n 5m (/ ) - f )(Xi)\ < C'H/olloclog^ + 1) 
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and 

E /0 (2(n Sm (/ ) - fo)(X t )) 2 < 4||/o||oo||n Sm (/o) - fo\\l 
it follows from Lemma 2 that 



/0 (p n - p)(2n Sm (/ ) - 2/ ) > 2 v /||/o||oo||n 5m (/o) - h\\ 2 



2A n 
n 



3n ) ~ 2 ' 

Using the inequality 2ab <a 2 + b 2 , and the fact that for n > 16, log(Z? + 1) < 
log(n 2 + 1), we obtain that there exists C" > such that 

P /o ((P n - P)(2U Sm (f ) - 2/0) - ||n 5m (/ ) - f \\l 

(5.12) 

C^H/ollooAnlogn N < On 
n ) ~ 2 ' 

We derive from (5.9), (5.10) and (5.12) that 

P /o (f m > ^ UDl n (\\M\oo + x/ll/olloo ) + H/ollooAn + ^) 

, C" || /o|| 00 A re log w 



< OL n . 



Finally, we notice that there exist some positive constants c(a) and c'{a) 
such that for n > 3, A n < c(a) log log n and A n < c'(a) log log n, which com- 
pletes the proof of Proposition 1. □ 

5.2.2. Uniform separation rates. Let us fix (3 in ]0,1[ and I in {1,2,3} 
such that T>i = {2 J ,0 < J < log 2 (n 2 /(loglogn) 3 )}. From Theorem 1 and 
Proposition 1, we deduce that if / satisfies 

11/ " /oil! > (1 + e) vriiWf - Us {LD) (f)\\l + W {ljD) (a) + V {l , D) (0)}, 

then 

P/(r a <o)</3. 

It is thus a matter of giving an upper bound for 

jnf 0/ - n S(i)D) (/)||^ + W m (a) + V m (P)}, 

when / belongs to some specified classes of functions. Recall that 
B®(R,M) 

= {/eL 2 (R), VL>eB z , \\f-Yl S[lD) {f)\\l<R 2 D- 2 \ ||/|U<M}. 
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We now assume that / belongs to B$\r,M). Since \\f - U S([D) {f)\\l < 
R 2 D~ 2s and since the constant Ci{fi,e, ||/||ooj ||/o||oo) m Theorem 1 can be 
taken such that C 2 (/3,e, ||/||oo, ||/o||oo) < C 2 ((3, e, M, 1 1 /o 1 1 oo ) , we only need 
an upper bound for 



D 



inf {b?D- 2s + C 1 (J3)(VM + M) 
+ C m ° + C(a) D{lOSl ° gn)2 



n 2 n 2 



\J D log log n 



+ C(«)(||/o|U + v / ll/olloc) 

, nt mi jr II (loglogn)logn C 2 ((3,£,M,\\f \\ oo )\ 
n raj 
Assuming that n > 16, this quantity is bounded from above by 

-D(log logn) 2 



+ (C 1 (/3)(v / M + M) + C(«)(||/o|U + x/ll/olloo )) v 



+ (CH||/o||oo + C 2 (Ae,M ) ||/ || 00 )) 



n 

(log log ra) log n 



n 

Since every D in T>i is smaller than ra 2 /(loglogn) 3 , 

inf {r 2 D~ 2s + ^ Dl °S l °Sn £>(loglog?i) 
DeVi\ ra n 2 

< 2 inf \r 2 D- 2 ° + ^ i0gl0gn 

We have R 2 D~ 2s < log log n/n if and only if D > (i^nVloglogn) 1 ^ 1-1-4 ^ , 
so we define by 

log 2 (A.) = [^((^^/(loglogn)) 1 ^ 1 ^)] + 1, 

and we consider three cases. 

The first one is the case where 1 < < 2[ log2 ( n /( lo s lo s n ) )] ; which means 
that D* G T>[. In this case, we have that 

inf { R 2 D- 2 ° + ^ l0gi0gn l < R 2 D- 2 * + V ^ l0gl ° gn 



De^i I ra n 

Since 

* \ n 
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and 



y/D* log log n < ^/ nR 2 \ 1 /( 4s+1 Vloglogn 
n ~~ V \/l g 1°§ n J n 



we obtain that 

dgx>, t n J V n J 

The second one is the case where > 2[ log 2( n2 /( 1 °g 1 °g»' 1 ) 3 )] . In this case, 
for all D in Di, 



y/D log log 



<R Z D 



2 n-2s 



By taking D = 2^( n /(ioglog«) 3 )] ) we obtain that 



inf <^ R z D' Zs + 



3\ 2s 

2 n-2s 



< < 2 2 S+ WQogy 3 Y 

n J \ n z J 



The third one is the case where .D* < 1. In this case, for all D in T>i, 
R 2 D~ 2s < y/D log log n/n, so by taking D = 1, we obtain that 

inf «fi? 2 Z5~ 2s + ^- Ploglogn ] < vjog^gn 
£>ex> ; \ n J - n 

This completes the proof of the corollary. 

5.3. Proof of Corollary 2. We use the same notation as in Theorem 1. 
We assume that /o = I[o,i] and Ai = {(1,D),D £ Pi}, which means that 
we only consider spaces generated by constant piecewise functions. We first 
prove that if there exists m in Ai such that 

(5.13) ||n Sm (/) - jTollI > (1 + e)(t m (u a ) + V m (P)), 

then 

P/(T a <0)</3. 

Using inequality (5.2) and the definition of m , we derive that 
P/(T a < 0) < inf F f (u n (H m ) + (P n - P)(2U Sm (f) ~ 2/o) 



+ ||n Sm (/) - /oils + 2 jT 1 / (/ - n 5m (/)) < u«a)) . 
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Since /o = I[o,i] and since, for all m in M, S m is generated by constant 
piecewise functions, we have 



/ 1 /o(/-n Sm (/)) = o. 

Jo 



We then obtain that 



P/(T a < 0) < inf F f (U n (H m ) + (P n - P)(2n Sm (/) - 2/ ) 

+ l|n 5m (/)-/o|l2<im(n Q )), 

and we conclude as in the proof of Theorem 1. 

Let for all s > and R > 0, H S (R) be defined by (2.4). Ingster [14] proved 
that for all functions g in H. S (R), for all D in N*, 

\\n s ^ D) { g )\\l>c x \\ g \\l-c 2 D- 2 \ 

where C\ e]0, 1[ and C 2 > (see [14], part III, inequality (5.16)). 

Assume that / G H S (R). Since /o = H[o,i] 5 /-/o£ H S (R). We then derive 
from Ingster's inequality that for all -D in Pi, 

l|n S(1 , D) (/ - /o)||^ > CiH/ - /o||l - c 2 d- 2s . 

Moreover, Ils (1 D (/ — /o) = IIs (1 D) (/) — /q. As a consequence, if there exists 
m = (1,.D) in .M such that 

Ci||/ - /0II2 > C 2 £- 2s + (1 + e)(* m (« a ) + K m (/3)), 
then (5.13) holds. The end of the proof is similar to the proof of Corollary 1. 

5.4. Proof of Theorem 2. Let T a be the test statistic defined by (3.4). 
We have that 

P/(T Q <0)< inf F f (T m (X 1 ,...,X n )< 

m£A4 

and since V 2 + 0, setting M' = {(2,D), D £ V 2 }, 

(5.14) F f (T a <0)< inf F f (f m (X 1 ,...,X n )<q m>a ). 

Let us introduce some notation. Using the definitions given in (5.1), for m 
in M' , we recall that 

^)=^e j>(^M^ 

+iiaii-|ea(^ 
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and we set 

ai(n,a) 

b(n,a) 



Pi 
' So 



X — LI 

a 

x — li 
a 



n 



(n-1) 



f(x)dx, 
f{x)dx, 

Pi 



Pi 



a 



L m (Li,a) = (P n - P)(2 ai(ii,a)pi 



a 
a 



ai(fi,a 
-2/o 



■A* 



a 



The following identity holds: 



Xi-jx X n - li\ _ 



T m ( — l — ~ , ■ • ■ , — "J = ^m(At, cr) + £mO, O") + ||/o||l 

leC ■m 

Recall that S m is the linear subspace of 1L-2(M) spanned by the functions 
{pi , I £ C m } and that Iis m denotes the orthogonal projection onto S m in 
L 2 (R). Then 

(5-15) J2 a 2 l {^o) = \\Tls m {°f{°- + M\l 

m 

After some easy computations, one can prove that 
II/0II2+ E af(f,,a)-2b^,a) 



■ a 



• - A* 



-/ 



af(a. + Li)-U Sm (af(a. + ^)\\l. 



This implies that 

F f (f m (Xx,...,X n )<q mt a) 

= P/( inf ^ Z m (Li,a) +L m (fi,a 



• - A* 



CJ V (J 



- |k/((T. -n Sm (<T/(<T. +At))|l2j < ?m,aj, 
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or 



^f(T m (Xi, . . .,X n ) < q m ,a) 



M inf \ Z m (fj,,a) + L m (n,a) + -a 
\(p,<r)eK I 4 



cr V cr 



+ l\\af(a.+fj,)-U Sm (af(a.+f,))\\l 



fo 



/' 



,, cr V cr 



1 + - ) \\af(a. + fi) — U Sm (af(a. + fi))\\ 2 \ < ) • 



Therefore, 



<P/( sup \-Z m (n,o) - L m (fi,a) 



f||a/(a. + /x)-n Sm (cr/(a. + M ))^} 



> [ 1 inf a 

A J MeK 



a \ a 



/' 



I + 7) sup \\af(a. + fi)-U Sm (crf(a. + fi))\\l - q m ,a ) • 
A) (jj,,a)eK 

Let r m (/3) denote the (1 — j3) quantile of snpf^ a \ eK T m (n, cr), where 



r m (n,a) = -Z m (/i,cr) - L m (n,a) - -a 



-fo 



/ 



- £ -\\af(a. + fi)-ns m (°f(v-+ml 
It follows from the above inequality and from (5.14) that if 



1 - - inf a 

A) MeJf 



1 



cr 



-/o 



cr 



/ 



> inf ((1 + 1) sup \\of{a. + ii)-n Sm {af{cT. + n))\\l 
meM'W A) (p >a -)eK 



+ T m (f3) + 
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then 

P/(T a <0)</3. 

Let m = (2, 2 J ) G M' . We now need to compute an upper bound for T m {(5) . 
To compute this upper bound, we introduce a finite grid on K = [fi,~p] x 

Let /xo < fJ-i < ■ ■ ■ < fJ-N and cjq < a\ < ■ ■ ■ < a^ such that /xo = f£, = ~P, 
o"o =QL, = for all S in {0, . . . , N-2}, \fj,g+i - Hs\ = A M , \a&+i -ag\ = A CT 
and | [i 7v — /Utv-i| < A^, |er/v — o"tv-i| < Ao-. A At and A CT will be chosen later. 
Let for (5,5') in {0, . . . , N - l} 2 , 

A5,<5' = [fJ*8, fJ-6+l] x k<5',0-<5'+l]- 
The following inequality holds: 

sup T m (fi,a)< sup T m (fj, 6 ,a &l ) 
{fj.,v)eK (5,5')e{o,...,Ar-i}2 

+ sup sup (r m (jU,<T) - T m (fi s ,a S ')). 

(5,6>)£{o,...,N-iy MeA ttS , 

5.4.1. Control o/sup^^g^ iA r-i} 2 r m (//5, cy). Introduce 

L« ( M , a) = (P n - P) (2a f - 2(U Sm (af(a. + /,))) 



£ -\\af{a. + ^)-Jl Srn {af{a. + ^))t 2 



and 



a \ a 



2 



Since 



L m (/i, a) = (P„ - P) 2a f - 2(U Sm (af(a. + fi))) 



a 



(P n -P)l2f (—^)-2af), 



we have that 



sup T m (n & ,a S ') 
(6,6')£{0,...,N-1} 2 

< sup \Z m (jj, s ,as>)\ 
(5,5')6{0,...,/V-l} 2 

+ sup L^(/j, s ,a S ') + sup L$(fjL S ,as')- 
(<5,<5')e{0,...,JV-l} 2 ((S,<5')e{0,...,jV-l} 2 
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(a) Control of sup (<5(5 , )e { 0j ...^-lp \Z m (ns,crs')\. We apply Lemma 1 by 
replacing the Xj's by the variables (Xj — fi)/a and the density / by the 
density of the (JQ — /i)/cr's that is af(a. + /u). With m = (2, 2 J ) and Z G 
£m} = {^J,fc 5 fceZ}, we obtain that there exists some constant C > such 
that for any (^,cr) in if, for any x > 0, 



z m (/i )f 7)|>^(: 

< 5.6e _a 



~jX -\ 



Hence, 



(5.16) 



sup | Z m (^5,0-5/ 
(<5,5')e{0,...,7V-l} 2 



c 



n 



>- 2 J / 2 v^(a 



< 5.6iV 2 e- x . 



(b) Control of sup^/^jQ v .. ) jv-i} 2 Lm(ps, a S')- To get an upper bound 
for this supremum, we use Lemma 2. We recall that for m = (2, 2 J ), I G 
An} = {(pj t k,k G Z}, where 99 = 2 J / 2 (/?(2 J • — A;) and </? is a compactly 
supported scaling function. Then, for all I in C m , 

||p;||oo<2 J/2 ||v?|| 00 . 

Assume that the support of the function cp is included in [— $, $], with $ > 0. 
This implies that for any x in K, the cardinality of the set {/ G C m ,pi(x) 7^ 0} 
is smaller than 2$ and that for all (fJ>,cr) in K, I in C m , 



~jX H x x 

n 



ai 



(H,a)\ <2$d2- J / 2 



Hence, for all x in M, 
(n Sr >/(a. + //))) 



x — [l 



x — 11 
a 



< 2$cr2~ J/2 



00 1 1 V 1 1 00 ^ ] 



x — 
a 



< 4$ 2 a\\f WML- 
Therefore, there exists some constant C > such that 



2af-2(U Sm (af(a. + ^))) 



<C"a 
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Furthermore, we have 



2af-2(U Sm (af(a. + ^))) 



/' 



a 



f 



< Aa\\f\U\af(a. + fi) - U Sm (af(a. + 

Then, for all (//, a) in K, Lemma 2 gives the following exponential inequality: 
for all x > 0, 



7 



(P n -P)(2<7/-2(n Sf >/(<7. + p))) 
Call/Hoc x 



• - A» 



> 



+ 2 v /2^mU||a/(a. + M ) - n 5m (a/( C T. + ^))ll 2 ^) < ^ ■ 
By using the elementary inequality 2ab < 4a 2 /e + e6 2 /4, we obtain that 



(5.17) F f 



sup 

(5,5')e{0,...,7V-l} 2 



L$(/w) > ( - + ^ )^ll/lloo- ) < N\~\ 



8 C" 



e 3 

(2)/ 



n 



r2 —x 



(c) Control of sup( 5 5 /) g { i ... i jv-i}2 L { m '(fi S , a s >). We control 

(2) 

SU P(<5 <5')e{o TV-i} 2 Lm (H5,&8>) in the same way, noticing that for any (/i,cr) 
in if! 



/' 



a 



2af 



and 
We get 
(5.18) 



2/c 



2a f) f< te- 



lle 



a 



SU P L mHv6,<76') 
(<5,<5')6{0,...,JV-1} 2 

>2(^||/||« + J!*^ 



<2(||/ ||oo + ff||/||oo) 



1 1 (--v- 

~/o 



a \ a 



f 



n 



- )<N 2 e- x . 



Assume that N>2. Collecting the inequalities (5.16), (5.17) and (5.18) 
and choosing x such that 7.6N 2 e~ x = (3/2, we obtain that there exists some 
positive constant C = C(a,e, \\f\\oo, ||/o||<x>>/3) such that 



7 



(5.19) 



sup T m (iJ, s ,a s >) 
(5,S')e{0,...,N-l} 2 

. c ( 2 Jl2 ^iN | logiV | 2 jlog 2 N\\^ 
n n n 2 J J ~ 2 
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5.4.2. Control o/BUP(a ji /) 6{0| ... l JV-l} a ,0*,«r)6i4 4)< ,( r m(M.^)- r m(^.O**'))- 
have 



r m (Ai,cr) 



1 



n(n — 1) 



E E 



e i 
4 1 



Xi-/jL\ {Xj-fJ, 

Pi 



a 



a|W) - -\\af{a. + li)-n Sm {of{a. + i£))\y 2 

m 

2 



e 

r 



i/o 



cr \ cr 
Using (5.15), this implies that 



/ 



1 



n(n — 1) 



E E 



+ (P re -P) 2/, 



1 1 



a 



a 



a 



2 "* " z 4 
+ (P n -P)f2/ 



5HI/III-7II/0III+ 1 + 7 E aiW) + ^>(/^ 



/' 



Let 



r^(/i,<j, 5, 5') 



1 



n(n — 1) 



x E E H : 

-Pi 



(Xi-fx\ fXj-n 



a 



Pi 



X{ — fig\ ( Xj — 11s 
Pi 1 



cr, <5, <5') = b(p,a) - b(fj, s ,a gl ), 
T$(f,,a,5,5') = (P n -P)(2f (- 



2/o 



P<5 



cr<5' 



Then 



sup sup (r m (fi,o) -T m (fi5,^6')) 

(5,5')e{0,...,7V-l} 2 ( ftff )eA w , 

< sup sup \T$(/j,,a,5,5')\ 
(6,5') MeAf tl , 
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+ ( 1 + 7) SU P su P {(!>,<?, 8, 8')\ 



(5,5') M&A S 5 , 



+ ^sup sup \T®(n,a,8,8')\ + sup sup \T$(fj,,a,8,8') 



(5,5') (^)&A S j, 



(5,5') (/i,<7)e4j, 



(a) Control of sup (5)5/)e{0) ... )JV _ 1} 2Sup (M)O . )eA(S5; \T^(fi,a,8,8')\. One can 
easily see that 

T^({i,a,8,8') 
1 



n(n — 1) 

n 

x E E 



Pl[ 



+ Pi 



V cr 



Pi[ 



P/ 



Pi 



X 3~ V5 \\ m ( X i- M 



(T$i 



a gi 



We recall that for all in R, the cardinality of the set {I £ C m ,pi(x) ^ 
Pl(y)} is not larger than 4$ and that for I in C m , 

IN|oo<2 J / 2 |M|oc. 

Since (p is a Lipschitz function with Lipschitz constant CL, we also have that 
for all x, y in R, 

E \Pi(x)-pi(y)\<^2 ZJ l 2 C v \x-y\. 

m 



(5.20) 
This implies that 



9 2J n Y . — i, Y- — 

«=1 



(7 



C<5' 



and 



< 



Assuming that (/12) holds, we derive from Lemma 2 that for all x > 0, 



(5.21) 



V E(l^|- E (l^l))^^z^ + cx <e~ x . 
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Since E(|Xj|) < y/v and N >2, by taking x = log(4N 2 /(3), we obtain that 
there exists some constant C(fj,,~p,a,a,v,c, (3) > such that 



(5.22) 



sup sup \T^(fi,a,S,S')\ 

(5,s>)e{o,...,N-iy (»,a)eA M , 



> C(/x,7Z,a,a, v, c,/?)2 2J (A At V A CT ) logiV ) < 



(b) Control of sup (5 >5 , )G{0 jv-i}2 sup^g^, \T$ (/j,,a,5,6')\. Since 

r m (A*) °") ^ *0 = {ai(fi,a) -ai(fi 5 ,a S ')){ai(n,a) + ai(fi s ,crs')) 

m 



4' 



and \ai(fJL,a) + aj(/x 5 , cr 5 /)| < 2.2 J / 2 ||<^|| 00 , 



\rg\n,a,5,5')\<2.2 J / 2 M\ 00 f £ 

leCn 



Pi 



x — fi 



a 



Pi 



x- us 
a si 



f(x)dx. 



By (5.20), we obtain 
\r$(p,a,6,6')\ 

< 8*11^00^2 



2.1 



x — i± x — [IS 



a 



os' 



f(x)dx, 



< 8<t>MooC^2 2J J (\x\^ + i(aA, + (|/l| V A ff )) /(*) dx. 

We deduce from (/J2) that there exists some constant C(fi,JI,a,a : , v) > such 
that 



(5.23) 



sup sup \T$(/j,,a,6,6')\ 

(<5,5')6{0,...,7V-1} 2 (n,cr)eA SiS , 

<C(fi,-p,a,a,iy)2 2J (A^\/ A CT ). 



(c) Control of sup( We { 0)< „ )JV _ 1} a sup^g^, |IV (/i, a, 5, Assum- 
ing that (h\) holds, we have that for any (fi,cr) in Agg', x in the support of 
/, 



(5.24) 



fo 



X — [1 

a 

<c f0 



fo 



x- lis 
as* 



(^)+^(aA /1 + (|/l|vy)A CT ; 
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Hence, we derive from (/12) that there exists some positive constant C(/i, fi,a, a, 
Cf ,u) such that 



(5.25) 



sup sup \r$(fi,cr,5,5')\ 

(5,5')G{0,...,7V-1} 2 (n,o-)eA SiS , 

<C(/x,p,<x,<7,C /o ,z/)(A^VA CT ). 



(d) Control of sup (5 5 , )e{0r . .^-lp ^P{n,a)eA SiSl l r m 0", <x, 5, S')\. It fol- 
lows from (5.24) that there exists C(/x,/J, a, a, C/ ) > such that 



SU P \r$(n,(r,5,5') 
(/i,o-)eA 6{ , 



1 n 

-^(|X i |+E(|X i |) + l)(A M VA CT ; 
n r— : 



Using again (5.21), we prove that there exists some positive constant C(/U, /j,,a, a, 
v,c,f3,Cf ) such that 



7 



(5.26) 



sup sup |r^(^,o", 5, 5')\ 

(<5,<5')e{0,...,7V-l} 2 (a^)6A m , 



> C% 71, a, a, i/, c, 0, C /o ) (A„ V A CT ) log ivj < I . 
Collecting (5.22), (5.23), (5.25) and (5.26), we get 

Pi- 



sup sup (T m (fj,,a) -T m (fj,s,os')) 

(5,5')6{0,...,7V-1} 2 (»,<T)eA M , 



>C2 2J (A^A a )logN)<^ 



(5.27) 



for some constant C = C(/x, fi,a,a,Cf , ||/|| 00 ,i', c,/3,e) > 0. Finally, by set- 
ting A^ = A a = n~ 2 2~ J , we have that 



n 



2 2 J (JI - //) A (ct - a) < N < n 2 2 J (jl - fj,) V (a- a) + 1. 



Hence, we deduce from (5.19) and (5.27) that if n 2 (7J— /x) A (a — a) > 2 and 
n > 2, there exists a constant C = C ([/,,]!, a, a, ||/o||oo) C/ > \\ f Woo, u,c,P,e) 
such that 

'2 J / 2 / , , — 2 J . o, o_ r . log(n 2 2 J ) 



/oJ/2 , oJ 

r m (/3) < C V lo g( n22J ) + -logV2 J ) + 

This concludes the proof of Theorem 2. 



n 
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5.5. Proof of Corollary 3. Since <j(2,D),a = t(2,D){ u a) with the same no- 
tation as in Section 2, we have that 

Q(2,D),a < W D {a), 

where 



W D {a) = ( (H/olloo + ^/ll/ollooVDloglogn 

■D(loglogra) 2 A 

H V /o co (log log n) , 

n J 

C(a) being a positive constant. This upper bound is obtained by replacing 
in the proof of Proposition 1 a n by a/2(l + log 2 n 2 ) and C"||/o||oolog(.D-|-l) 
by C'll/olloc in (5.11). 

Let / belong to B S (R,M) and satisfy (hi) and (/12). 

From Theorem 2 and the above upper bound, we deduce that there exists 
some positive constant C = C(fi,~p,a,a,Cf , ||/o||oo, M, u, c, a, 0, s) such that 
if / satisfies 



inf 

(fJ,,<T)&K 



/--/( 



1, f-y> 2 



. , 

a \ a 



2 Dev 2 I n 



D\og 2 {n 2 D) \og{n 2 D) 



+ — 2 - + 

then 

P/(T a <0)</3. 

Since V 2 = {2 J ,0 < J < 2[ lo S2( n2 / lo s 3n )]} and n > 3, there exists c> such 
that 



f (r 2 D- 2s + v^M^) + fllogVZ?) { log(n 2 D) \ 
DeT> 2 \ n n 2 n J 

<c( inf W^ + ^^l + ^Y 
\D£V 2 { n ) n J 

and Corollary 3 can be proved in the same way as Corollary 1. 
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